Application of multiple shrinkage methods to genomic predictions.

نویسندگان

  • Christian Maltecca
  • Kristen L Parker
  • Joseph P Cassady
چکیده

New challenges have arisen with the development of large marker panels for livestock species. Models easily become overparameterized when all available markers are included. Solutions have led to the development of shrinkage or regularization techniques. The objective of this study was the application and comparison of Bayesian LASSO (B-L), thick-tailed (Student-t), and semiparametric multiple shrinkage methods. The B-L and Student-t methods were also each analyzed within a single shrinkage and a multiple shrinkage framework. Simulated and real data were used to evaluate each method's performance. Real data consisted of SNP genotypes of 4,069 Holstein sires. Traits included in analysis of real data were milk, fat, protein yield, and somatic cell score. The performance of each model was compared based on correlations between true and predicted genomic predicted transmitting abilities. Model performance was also compared with the performance of routinely used methods such as Bayes-A and GBLUP through cross-validation techniques. When using simulated data regardless of shrinkage framework, shrinkage models outperformed genomic BLUP (GBLUP). The average advantage of shrinkage models ranged from 1% to approximately 8% depending on the prior specification. When analyzing real data, shrinkage models slightly outperformed GBLUP for most traits. Shrinkage models were better able to model traits for which 1 or more SNP of large effect have been identified. Overall, results suggested a relatively small advantage in multiple shrinkage models. Multiple shrinkage methods could represent a useful alternative to current methods of prediction; however, their performance in a variety of scenarios needs to be investigated further.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Positive-Shrinkage and Pretest Estimation in Multiple Regression: A Monte Carlo Study with Applications

Consider a problem of predicting a response variable using a set of covariates in a linear regression model. If it is a priori known or suspected that a subset of the covariates do not significantly contribute to the overall fit of the model, a restricted model that excludes these covariates, may be sufficient. If, on the other hand, the subset provides useful information, shrinkage meth...

متن کامل

صحت انتخاب ژنومی روش‌های پارامتری و ناپارامتری با معماری‌های ژنتیکی افزایشی و غالبیت

     In most genomic prediction studies only additive effects will be used in models for estimating genomic breeding values (GEBV). However, dominance genetic effects are an important source of variation for complex traits, considering them into account may improve the accuracy of GEBV. In the present  study,  performed applying  simulated data, the effect of  different heritability values (0.1...

متن کامل

The Impact of Different Genetic Architectures on Accuracy of Genomic Selection Using Three Bayesian Methods

Genome-wide evaluation uses the associations of a large number of single nucleotide polymorphism (SNP) markers across the whole genome and then combines the statistical methods with genomic data to predict the genetic values. Genomic predictions relieson linkage disequilibrium (LD) between genetic markers and quantitative trait loci (QTL) in a population. Methods that use all markers simultaneo...

متن کامل

Factors affecting accuracy from genomic selection in populations derived from multiple inbred lines: a Barley case study.

We compared the accuracies of four genomic-selection prediction methods as affected by marker density, level of linkage disequilibrium (LD), quantitative trait locus (QTL) number, sample size, and level of replication in populations generated from multiple inbred lines. Marker data on 42 two-row spring barley inbred lines were used to simulate high and low LD populations from multiple inbred li...

متن کامل

Effect of Markers Effect Estimation Methods, Population Structure and Trait Architercture on the Accuracy of Genomic Breeding Values

This study aimed to investigate the  effect  of  the method of estimating the effects of markers , QTLs distribution, number of QTLs, effective population size and trait heritability on the accuracy of genomic predictions. Two effective population sizes, 100 and 500 individuals, were simulated by QMSim software. A 100 cM genome including one chromosome was simulated where 500 SNPs and two diffe...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Journal of animal science

دوره 90 6  شماره 

صفحات  -

تاریخ انتشار 2012